Korpus: tat_news_2005-2011

Weitere Korpora

2.2.12 Typical Prefixes and Suffixes

Typical prefixes and suffixes of words of length 1 and 2 using Levenshtein

Common Prefix
Prefix Count Percent
к- 257 0.0458
с- 211 0.0376
Т- 147 0.0262
А.- 134 0.0239
б- 119 0.0212
Р.- 104 0.0185
М.- 84 0.0150
1- 75 0.0134
җ- 67 0.0119
ч- 67 0.0119
В.- 56 0.0100
И.- 51 0.0091
2- 50 0.0089
ре- 49 0.0087
Г.- 46 0.0082
үз- 42 0.0075
-- 37 0.0066
Н.- 37 0.0066
С.- 35 0.0062
ко- 35 0.0062
Common Suffix
Suffix Count Percent
4808 0.8569
3091 0.5509
2289 0.4079
2221 0.3958
-ны 2216 0.3949
-да 2133 0.3801
-га 1942 0.3461
-не 1457 0.2597
-на 1441 0.2568
-дә 1432 0.2552
-гә 1305 0.2326
-ын 1250 0.2228
1094 0.1950
-ың 951 0.1695
-нә 944 0.1682
935 0.1666
-гы 855 0.1524
-ен 736 0.1312
-ка 690 0.1230
-ең 626 0.1116
Ratio of Suffixes/Prefixes
Ratio
12.2877
Word Pairs with 2>=Levenshtein among top words
Top words Count
100 18
300 92
1000 598
3000 3742
10000 25732
30000 136282
78047 561104


Word Pairs with 2>=Levenshtein among top words


Gnuplot diagram

5993 msec needed at 2017-10-22 15:11